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Abstract 



We examine the long-term asymptotic behavior of dissipating solutions to aggregation equa- 
tions and Patlak-Keller-Segel models with degenerate power-law and linear diffusion. The pur- 
pose of this work is to identify when solutions decay to the self-similar spreading solutions of 
the homogeneous diffusion equations. Combined with strong decay estimates, entropy-entropy 
dissipation methods provide a natural solution to this question and make it possible to derive 
quantitative convergence rates in L . The estimated rate depends only on the nonlinearity of 
the diffusion and the strength of the interaction kernel at long range. 

1 Introduction 

The most widely studied mathematical models of nonlocal aggregation phenomena are the Patlak- 
Keller-Segel (PKS) models, originally introduced to study the chemotaxis of microorganisms [40|, 
1281 1251 [23] . Similar models are also used to study the formation of herds and flocks in ecological 
systems |1 1 1. H9| [37| 123] . A common theme is the competition between the tendency for organisms 
to diffuse, e.g. under Brownian motion or to avoid over-crowding, and for organisms to aggregate 
into groups through nonlocal self-attraction. The parabolic-elliptic PKS models are a subclass of 
the general aggregation-diffusion equations 

ut + V • (uV!C*u) = AA(u). (1) 

The local and global existence and uniqueness of models such as (H|) is well studied (see for instance 
El El QUI 133 SSI 133 EQ]). However, less is known about the long-term qualitative behavior 
of solutions. In this work, we are interested in examining the asymptotic profiles of dissipating 
solutions to (JT]) in the special case 

u t + V • (uV/C * it) = Au m , m>l, 

u(0, x) = u (x) e L\(R d ; (1 + \x\ 2 )dx) n L°°(R d ), ( ' 

where L^(M d ; fx) := {/ G L 1 (R d ; ll) : f > 0}. In particular, we are interested in determining when 
solutions to ([2]) converge in L l (M. d ) as t — > oo to the self-similar spreading solutions of the diffusion 
equation 

u t = Au m . (3) 

All dissipating solutions are weak* converging to zero as t — > oo, but this kind of result implies that 
for 1 << t < oo, the dissipating solutions all look more or less like self-similar solutions of ([3]). For 
this reason, these results are often referred to as intermediate asymptotics. 
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Supercritical problems are those in which the aggregation is dominant at high concentrations, 
subcritical problems are those in which the diffusion dominates at high concentrations, and critical 
problems are those in which the effects are in approximate balance. It is known that supercritical 
problems exhibit finite time blow up for solutions of arbitrarily small mass and subcritical problems 
have global solutions [Ml HU [5J [8] . The critical case is more interesting; data with small mass exists 
globally, whereas finite time blow up is possible for large mass [U El [TUl 146] . In this work, we will 
refer to the case m < 2 — 2/d as supercritical and m = 2 — 2/d as critical. This is in contrast to the 
definition used in [5] , where the critical diffusion exponent was taken to depend on the singularity of 
the kernel. Here, achieving such a precise balance is not the primary interest and moreover we are 
concerned with examining the limit of low concentrations. In the sense of [H (U [46], |45j , m = 2 — 2/d 
is the critical exponent for the Newtonian potential, which is the most singular kernel known to 
have unique, local-in-time solutions [5]. 

As strong nonlinearities vanish quickly near zero, scaling heuristics suggest that the nonlocal 
aggregation term should become irrelevant for small data in the critical and supercritical regime. 
We use entropy-entropy dissipation methods [181 EH CEH1 EE El E] to obtain several intermediate 
asymptotics results which show this to be true, and that solutions of ([2]) converge to self-similar 
solutions of ([3]). Entropy-entropy dissipation methods are well-suited for proving the convergence 
to equilibrium states of nonlinear Fokker-Plank-type equations for arbitrary data |181 ll5j. Through 
a change of variables employed below, this also provides convergence to self-similarity of nonlinear 
homogeneous diffusion equations [19J. In contrast to these works, we employ such methods to 
prove a small data result, treating the nonlocal aggregation term as a perturbation. For this 
to work, sufficiently strong decay estimates on the solution must be obtained. Indeed, strong 
decay estimates imply the intermediate asymptotics results, and so we have chosen to state them 
separately in Theorem [JJ below. Here, we obtain these estimates using iteration methods, discussed 
in more detail below, which are a refinement of the local theory of §2§ (see e.g. [U [5]). While 
nonlinear, they are essentially perturbative in nature and thus somewhat limited against arbitrary 
data, using basic dissipation estimates to over-power the nonlocal advection term only under certain 
conditions. Analogous to related models, such as the nonlinear Schrodinger equations, it is likely a 
fully non-perturbative theory will need to be applied in order to treat large data, which is sometimes 
significantly more difficult (see for instance [Ml [29]). More details and discussion about the results 
and the methods of the proofs are discussed below in §1.11 and H1.21 

The first of our intermediate asymptotics results, Theorem covers the case K, € W 1 ' 1 (M. d ). 
Here, the nonlocal term can be considered to have a finite characteristic length-scale which becomes 
vanishingly small relative to the length-scale of the solution as it dissipates. A result similar to 
Theorem [2] for L p , 1 < p < oo, was proved for the special case of the Bessel potential in [341 
[35] with the soft compactness method of [27] (see also [ST]). In contrast to methods based on 
compactness, the entropy-entropy dissipation methods obtain quantitative convergence rates in L 1 , 
which by interpolation against the decay estimates, provides convergence in all L p , 1 < p < oo. For 
supercritical problems, the convergence rate is shown to be the same as the optimal rates for ([3]) 

[II EH H Eg EE]. 

In general, if the kernel does not have critical scaling at large length-scales, the long-range effects 
should still become irrelevant as the solution dissipates. That is, we should expect results similar 
to the /C £ W 1 ' 1 (R d ) case to hold, except when m = 2 — 2/d and V/C ~ \x\ as \x\ — > oo. Indeed, 
when K, is the Newtonian potential, there exists at least one self-similar spreading solution to ([2]) 
when m = 2 — 2/d [TUl El [J3]. In the presence of linear diffusion, these are additionally known 
to be the global attractors [TOj, [9]. Theorem [3] below extends Theorem [2] to the general case of 
K W 1,x {R d ), where the decay of /C is characterized by 7 G [d-l, d] such that \VfC(x)\ = 0(\x\~ J ) 
as I a; I — > 00. We show that if 7 > d — 1, then dissipating solutions converge to the self-similar 
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spreading solutions of ([3j) . However, in contrast to Theorem [21 the long-range effects appear to 
degrade the convergence rate and Theorem [3] provides a quantitative estimate of this effect in terms 
of m and 7. It is not known whether the rates obtained in Theorem [3] are sharp. When 7 = d — 1, 
the kernel behaves like the Newtonian potential on large length-scales, and the result is no longer 
expected to hold if m = 2 — 2/d. Indeed, we expect solutions to converge to the self-similar solutions 
of ([!]) constructed in [10} IB]. However, Theorem [3] asserts that in supercritical cases, self-similar 
solutions to ([3]) again govern the intermediate asymptotics. Thus, Theorem [3] provides intermediate 
asymptotics for Patlak-Keller-Segel models with linear diffusion in dimensions d > 3. 

As remarked above, the results of Theorems [2] and [3] hold because for small data, the diffusion 
dominates the global dynamics. On the other hand, for subcritical problems, the aggregation can 
dominate at large length-scales, leading to the existence of nontrivial stationary solutions [33j 13]. 
which clearly violate strong decay estimates such as those stated below in Theorem [TJ However, 
between the results here and the work of |33[[3]. not every case is covered. For instance, if K, € 
and 2 — 2/d < m < 2, stationary solutions are only known to exist for sufficiently large mass, and 
the behavior of smaller solutions is unknown. Moreover, convergence to these stationary solutions 
is only known in certain cases |30| . 

In what follows, we denote ||it|| p := IMIz,p(]Rd) where L p (M. d ) := LP is the standard Lebesgue 
space. We will often suppress the dependencies of functions on space and/or time to enhance 
readability. The standard characteristic function for some S C M. d is denoted I5 and we denote the 
ball Br{xq) := {x E M. d : \x — xq\ < R} . In formulas we use the notation C(p,k, M, ..) to denote a 
generic constant, which may be different from line to line or term to term in the same formula. In 
general, these constants will depend on more parameters than those listed, for instance those which 
are fixed by the problem, such as /C and the dimension, but these dependencies are suppressed. 
We use the notation / < P! k,... g to denote / < C(p,k,..)g where again, dependencies that are not 
relevant are suppressed. 

1.1 Statement of Results 

We need the following definition from [5], which we restate here. 

Definition 1 (Admissible Kernel). We say a kernel K € C 3 \ {0} is admissible if K E W^(R d ) 
and the following holds: 

(KN) fC is radially symmetric, K,{x) = k{\x\) and is non-increasing. 

(MN) k"{r) and k'(r)/r are monotone on r € (0, <5) for some 5 > 0. 

(BD) \D 3 K(x)\ < \x\- d -\ 

The definition ensures that the kernel is radially symmetric, attractive, reasonably well-behaved 
at the origin and has second derivatives which define bounded convolution operators on LP for 1 < 
p < 00. It is important to note that all admissible kernels satisfy V/C € L^^' 00 , where L p, °° denotes 
the weak-L p space, making the Newtonian potential effectively the most singular of admissible 
kernels [5J. Provided K, is admissible, for a given initial condition u${x) £ Li_(R d ; (1 + |x| 2 )dx) n 
L°°(R d ), has a unique, local-in-time weak solution which satisfies u(t) 6 C([0, T); L+(M. d ; (1 + 
\x\ 2 )dx)) fl L°°((0,T) x R d ) [3 El [TUl SZl S]- Moreover, u(t) is a solution to © in a sense which is 
stronger than a distribution solution, which is important for obtaining a well-behaved local theory, 
but the distinction will not be important here 0E]- Weak solutions conserve mass and we define 
M= ||uo||i = IK*)||i- 
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The self-similar solutions to the diffusion equation (|3|) are well-known, see for instance |51j and 
|19j . In the linear case m = 1, the self-similar solution is simply the heat kernel, 

In the case of degenerate diffusion m > 1, the self-similar solution is given by the Barenblatt 
solution, 

U(t, x; M) = ( Cl - |*| 2 ^ , (5) 



where C\ is determined from the conservation of mass and 

1 

d(m- 1) +2' 



/5=^7- i TT-^- (6) 



Note that, 

and so to provide a meaningful characterization of the convergence to self-similarity, quantitative 
estimates will be stated in terms of this relative scale. 

The entropy-entropy dissipation methods of [181 EH US IS] were used to determine the optimal 
rate of convergence in L l (M. d ) to self-similarity. That is, any solution u(t) of ([3]) satisfies 

t d ^~v)\\u(t)-U{t;M)\\ p < (l + t)~f min (5-^), Vp, 1 <p < oo. 

This rate should be contrasted with the rates obtained in Theorems [2] and EJ where it is shown that 
kernels with finite length-scales do not have much effect on the rate, but strong nonlocal effects do. 

In order to emphasize the relationship between decay estimates and intermediate asymptotics, 
we state them separately. Results similar to (%) of Theorem [T] have been obtained in a variety of 
places, for example [HJ|36j[9]. Ovrr estimates are obtained in a closely related but different way than 
existing work. We first rescale into the self-similar variables of the diffusion equation as in [9], and 
then adapt the Alikakos [1] iteration techniques of [31] US El E] , which are variants of fairly standard 
methods for obtaining uniform in time L°° bounds for PKS models (see also |26 } 145 } l4"6 l I47j). This 
approach to decay estimates has the advantage of naturally extending the existing methods used 
to obtain uniform bounds, and for a relatively mild increase in complexity, much stronger results 
are obtained. Here we use this advantage to also deduce a sufficient condition for decay estimates 
to hold in the critical case 2 — 2/d, (ii) of Theorem Q] below. For critical problems, uniform equi- 
integrability in time is known to be equivalent to global uniform boundedness for solutions to ([2]) 
|12} El [5] , and due to the similarities in the proof, we may state something analogous for decay 
estimates. Indeed, ([8]) is simply the requirement that the solution of the rescaled system remain 
uniformly equi-integrable. The proofs of Theorems 11121 and [3] are outlined in more detail in §1.21 
Remarks on the limitations and possible extensions are made after the statements. 

Theorem 1 (Decay Estimates). Let d > 2, m € [1, 2 — 2/d] and K admissible. Let uq G L^(M d ; (1 + 
\x\ 2 )dx)nL ca (R d ). 

(i) There exists an e$ > (independent of uq) such that if ||uo||i + [|^o||(2-m)rf/2 < e o> then the 
weak solution u{t) to ([2]) which satisfies u(0) = uq is global and satisfies the decay estimate 



<(l + t)-^. (7) 
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(ii) If m = 2 — 2/d and u(t) is a global weak solution to ([2]) which satisfies 



t ^ 



fc ^°°te[o,oo) J \ \P 
then u(t) satisfies the decay estimate (|7|). 



lim slip / (;/(/.. r) - /v (-+ I ) J dx = 0, (8) 



Once the decay estimate ([7|) has been established, entropy-entropy dissipation methods can be 
adapted to deduce the following intermediate asymptotics theorems, as the decay estimate provides 
sufficient control of the nonlocal terms. 

Theorem 2 (Intermediate Asymptotics I: Finite Length-Scale). Let d > 2, m € [1,2 — 2/d] and 
K € W ' be admissible. Suppose u(t) is a global weak solution of ([2]) which satisfies the decay 
estimate ([7]). If m < 2 — 2/d, then u(t) satisfies 



v)\\ u (t)-U(t;M)\\ p < (l + t)~*, Vp, l<p<oo, (9) 
and if m = 2 — 2/d, then for all 5 > 0, u(t) satisfies 

t dP ( l -v)\\u(t)-U{t-M)\\ p < 5 (1 + t)-v (l - 5 \ Vp, 1 < p < oo. (10) 

Here f3 is defined in ([6]) and U(x,t; M) is the self-similar solution to ([3]) with mass M = ||ito||i 
given in @ or ([5]). 

Theorem 3 (Intermediate Asymptotics II: Infinite Length-Scales). Let d > 2 and /C be admissible 
with V/C(x) = 0(|x|~ 7 ) as \x | — >■ oo for some 7 G [d— 1, d]. Ifj = d— 1 then suppose m 6 [1, 2 — 2/d) 
and otherwise we may take m G [1,2 — 2/d]. Suppose u(t) is a global weak solution of ([2]) which 
satisfies the decay estimate ([7]). Then, for all 5 > 0, u(t) satisfies 

\u(t)-U{t-M)\\ p < 5 (1 + t)"f -MWr-/?- 1 -^ V p, 1 < p < 00. (11) 
Here /3 andU(t,x;M) are as above. 

Remark 1. Note that n € L\(R d ; (1 + |x| 2 )dx) n L°°(R d ) implies u logu € L x (IR d ) by Jensen's 
inequality. 

Remark 2. The convergence rate in Q is optimal, as it matches that of the corresponding diffusion 
equation. Optimality is not known for (|1U|) or (jll|) . however we suspect that these rates are nearly 
optimal. Note that the convergence rate obtained in (jlip reduces to ([9|) and (|10p when 7 = d. 
Moreover, if 7 = d — 1, then the convergence rate goes to zero asm /* 2 — 2/d. 

Remark 3. In some cases, it may be possible to show that the size conditions of (i) of Theorem [1] 
are only required of the £( 2_m ) rf / 2 norm, which is the norm left invariant by the scaling symmetry 
of ([2]) when K, is the Newtonian potential. Indeed, one can prove global existence and uniform 
boundedness of solutions to ([2]) in these supercritical regimes using iteration techniques without 
any smallness assumption on the mass [201 145|, H6] . However, in [33] it is shown that depending on 
the singularity of the kernel, if the mass is sufficiently large, there may exist stationary solutions 
for all m, 1 < m < 2. Since these are potentially included in our analysis, in order to state decay 
results in the generality we have chosen to, a smallness condition on the mass seems to be necessary. 

Remark 4. The results of [5] suggest that if the kernel K, is less singular than the Newtonian potential 
at the origin, the L( 2 - m W 2 norm could, in some cases, possibly be replaced by a weaker one. 
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Remark 5. In the critical case m = 2 — 2/d, in general eo will be strictly less than the critical mass 
[j~m [3 [5] . As shown in [121 QUI E E] , equi-integrability conditions such as (jSJ) allow one to use the 
dissipated free energy to deduce global control up to the sharp critical mass. If JC is the Newtonian 
potential and d > 3, then the energy dissipation inequality is sufficient to imply ([8]) and therefore 
solutions satisfy ([7]) all the way up to the critical mass [8]. Unfortunately, the energy dissipation 
inequality does not always seem to imply ([8]) in general. 

Remark 6. We consider only the case of power-law diffusion, however, the estimate (J7|) holds for 
([1]) provided A'(z) > cz m ~ l for some c > 0. Therefore, it is likely possible to apply the methods of 
[14] to this more general case under some structural assumptions on A. 



1.2 Outline of Proof 

The proof of Theorems [2] and [3] involves several steps. As mentioned above, we use the entropy- 
entropy dissipation methods of [181 [1"9| [T5] and in particular, the time-dependent rescaling used in 
[19] . All of the computations will be formal, they can be made rigorous for weak solutions either 
with a suitable parabolic regularization and passing to the limit, as in for instance [6l [151 SI) or 
presumably also lifting to strictly positive solutions, as is common in the study of the porous media 
equation [51]. 

Following [19], we define 9(r,rj) such that 

e- dr 6(T, V ) = u(t,x), (12) 

with coordinates e T r\ = x and /3e^ lr — (3 = t, where /3 is given by ([6]). In what follows we denote 
a := df3. In these coordinates, if u(t,x) solves (|2|) then 6(r,r]) solves, 

d T = V • (rfi) + A0 m - e (i-a-«r^v • {e(e dT VJC(e T -) * 6)). (13) 

Moreover, U(t,x; M) is stationary in these coordinates, and will be denoted by 6m(v)- That is (see 

-13 




In fact, 8m{v) is the unique non-negative solution with mass M to the (degenerate, if m > 1) elliptic 
equation 

= V • (rfl) + A6 m . (15) 

In what follows we will refer to 9m as the ground state Barenblatt solution. Clearly, ground state 
solutions are stationary solutions of the homogeneous Fokker-Plank equation 

d T 9 = V • (rjO) + A9 m . (16) 

Therefore, the asymptotic convergence to self-similar profiles of solutions to ([3|) is equivalent to the 
convergence to the stationary profiles of (|16p . This was the fundamental observation made in [19] 
and is the purpose of the rescaling (fT2j) . 

A primary step to proving Theorems [2] and [3] is establishing that 0(r,rj) € L^ r) {U. + x M. d ). By 
the change of variables, this is precisely the decay estimate ([7]) stated in Theorem Q3 This estimate 
is what allows us to treat the inhomogeneous non-local term in (|13p as a vanishing perturbation 
of (fl~6l) . The decay estimate ||u(t)||oo ^5 t~ d ^ , or equivalently, 1 1 ^ ("7") 1 1 oo ^ 1> is easily obtained for 
([3]) in the linear case and the classical Aronson-Benilan estimate proves it in the case m > 1 |51| . 
Clearly, no such analogues are available for (|13fl . and they will instead be provided by Theorem [TJ 
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To prove Theorem [IJ we adapt the Alikakos iterations of |3H [T2| [HI [5] to (|13[) to prove a uniform 
bound in the rescaled variables. Dealing with time-dependent rescalings in (|13p introduces several 
complications. Obtaining LP estimates for the critical case m = 2 — 2/d is relatively straightforward 
due to the inherent scale- invariance of the relevant inequalities. In the supercritical case m < 2—2/d, 
the effect of the time-dependent rescaling in (|13j) is crucial for closing a key bootstrap/continuity 
argument necessary to control the solution uniformly in time. It is at this step that the method 
below diverges significantly from existing methods and is a key step to obtaining Theorem [TJ The 
additional issue when V/C L 1 is in obtaining L°° estimates, which requires measuring the rate at 
which e dr VIC(e T 7]) blows up in Lj as r — > oo. This also arises later when we estimate how much 
the nonlocal term affects the entropy dissipation and is the source of the degraded convergence rates 
found in Theorem [3j 

Once we have established 0(r,rj) € L^ rj (R + x M"), we prove that solutions to (fT3j) converge to 
9m an d estimate the convergence rate in L . In fact, these are done together, as the quantitative 
estimate is direct and removes the need for compactness arguments. The primary step of the entropy 
dissipation method is an estimate of the decay of the entropy associated to (fT6|) . In the case m = 1, 
the entropy is given by, 

H{9) = J 9log9dr) + ^ J \r]\ 2 Odrj, (17) 
and the entropy production functional by 

1(6) = [ 9\Vlog9 + 7]\ 2 drj. (18) 



In the nonlinear case m > 1, the corresponding quantities are, 



H{9) = — — I 9 m d V + ^ I \r,\ 2 9d V , (19) 
m — 1 / 2 



and the entropy production functional. 

1(9) = 



171 rn 1 



m — 1 



2 

dr). (20) 



In the nonlinear case, these entropies were originally introduced for studying (|16p in [38\ 143]. Both 
(fT7|) and (Tl9j) are displacement convex [36] and in fact, (fTBj) is a gradient flow for (fT9|) or (fT7l) in the 
Euclidean Wasserstein distance [33 [2j [TBI E] , and if f(r, rj) solves (fTB]) . then 

^H(f(r)) = -I(f(r)). 

For a given mass M, (I19p has a unique non- negative minimizer which is the ground state 9m- That 
is, if we define the relative entropy 

H(9\9 M ) = H(9)-H(9 M ), (21) 

then H(9\9m) > with equality if and only if 9 = 9m |191 142j . In order to estimate a convergence 
rate, it is therefore sensible to measure how quickly H(9\9m) —> 0. Following the methods of 
\19\ [T5l [TBI 117] . this is made possible by the following two theorems. The first relates the entropy 
production functional (|20p to the relative entropy (|21l) . This represents a generalization of the Gross 
logarithmic inequality [22] (see also [42] ) . 
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Theorem 4 (Generalized Gross Logarithmic Sobolev Inequality [191 13 H21 [22] ) • Let f G L\_(R d ) 
with || = M and let 9 m be the ground state Barenblatt solution with mass M . Then, 

H(f\9 M ) < \l{f). (22) 

For the Fokker-Plank equation (I16p . Theorem [H implies H(Q(t)\Qm) ^ e~ 2r . The (generalized) 
Csiszar-Kullback inequality |21|, [32] relates the relative entropy to the L 1 norm. 

Theorem 5 (Csiszar-Kullback Inequality J5]). Let f G L^(R d ) with ||/||i = M and let 9 M be the 

ground state Barenblatt solution with mass M . Then, 

\\f-e M \\i<H{f\e M r<^). (23) 

Note that since we are interested in 1 < m < 2 — 2/d, we will only apply the inequality with 
exponent 1/2. 

To prove Theorems [2] and El the purpose of proving 9(r,r)) G L^ ri (M. + x M. d ) is to control the 
growth of ||e rfr V/C(e T -) * #||oo! which depends on the long-range effects of the kernel. Ultimately, 
this provides a bound essentially of the form, 

^H{6(t)) < -I(6(t)) + C(M, |I0|U~ (R+xRd) )e-^, 
for some 7 > (in reality, it is not quite as clean). Theorem Q then implies, 

±H(9(r)\9 M ) < -2H(9(t)\8 m ) + C(M, ||0|| L ~ {M+xKd) )e-^. 
Integrating this and applying Theorem [5] implies, 

||#-Mi<e-5 min ( 2 'A 

which after rescaling and interpolation against the decay estimate ([7]) , will prove Theorems [2] and 

El 

2 Preliminary Decay Estimates 

Let q = (2 — m)d/2 and let n, r and 9(T,r]) be as defined in §1.21 As detailed above, we establish 
that 9(t,t]) G L~ (M+ x R d ) using Alikakos iteration [Tj (see also [261 ED E US El HH])- The 
first step is to prove the following lemma which allows control over LP norms with p < 00. In what 
follows we denote 9q(t]) := 9(n,0) = u(x,0). 

Lemma 1 (Control for U\ p < 00 for small data). For allq<p< 00, there exists C-q = Cq(p, M) 
and Cm = Cm{p, \\9o\\q) such that if \\6 \\g < Cq and M < Cm, then ||0(t)|| p G L^°(R + ). 

Proof. Define 

nm—l 



V9 p/2 ' 2 



dx. 



We estimate the time evolution of \\9\\ p using integration by parts, Holder's inequality and Lemma 
[6] in the appendix, 

= -J^TW 1 + ( p - l)e (1 ~ a -^ V J 9?V ■ (e dT V/C(e--) * 8)dn + d(p - 

< -C(p)l+C(p)e^ a -^-^\\9\\ p p+1 \\V(e dT VIC(e^) * 0)|| p+1 + C(p)\\6\\* 

< _c(p)l + C(p)e^- a ^\\9\\lXl + C(p)\\8\\P (24) 
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We bound the second term using the using the homogeneous Gagliardo-Nirenberg-Sobolev inequality 
(Lemma [5] in appendix), 

Mill < \\6\\f {p+l) Z a ^ +l V\ (25) 
where Q2 = 1 — ot\{p + m — l)/2 and 

2d(q-p-l) 

ai - 



{p + 1) (q{d - 2) - d(p + m - 1)) 

By the definition of q we have that, 

ai(p + 1) = d(q-p - 1) = 

2 g(d - 2) - d(p + m - 1) 

We also estimate the second term in (|24|) using Lemma O 

< M^ P I M2 , (26) 

where /?2 = 1 — Pip/2 and, 

ffiP = <^(P ~ 1) < j 

2 2-d + d(p + m-l) 

by 1 — 2/d < m. Then applying weighted Young's inequality we have from (|25p . (|26[) and (|24p . 

< (Cx^e^-^HI^IIJ 2 ^ -C 2 (pj)l + C 3 (p)Wto, (27) 

for 7(7?) = 2fl<ivl (2 — /3ip) > 0. If m = 2 — 2/d, then q = 1 and 1 — a = 0, therefore by conservation 
of mass it is possible to choose M sufficiently small such that the first term in (I27p is less than —51 
for some 5 > 0. If m < 2 — 2/d, then g > 1 and ||#||<j is no longer conserved. Here we must take 
advantage of 1 — a < 0. Note that (|27l) holds for p = q; therefore since 1 — a < 0, a continuity 
argument establishes that for ||#o||g an d M sufficiently small, 

\\9{r)\%<\\e \% + C,(q)M^r. 

Indeed, for ||0o||g small, this holds for at least some time, and for M sufficiently small, this linear 
growth is such that the first term in (I27p remains non-positive forever. Then by (I27p for p > q, if 
M and \\0o\\q additionally satisfy 

Ci(p)e( 1 -°^" lT (C73(g)AP'®r + ||e |||) a2(p+1)/5 - C 2 (p) < -5, 

for all r > 0, then the first term is less than —SI. By 1 — a < we may always choose M and ||#o||g 
such that this is possible. Therefore, whether q > 1 or g = 1, for small initial data in the suitable 
sense, we have 

±\\9\\P<-SI+C(M,p). 
Using (|26p and Young's inequality for products, we have a lower bound on I, 



\\e%-c{M)<x. 

This proves, 

^\\e\\l<-s\\e\\i + c{M,p), 

which immediately concludes the lemma with \\0\\p < max(||#o||p> C(M,p)S □ 
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We now turn to proving that (|8|) implies something analogous to Lemma [TJ Let u(t) be as in 
(ii) of Theorem [TJ One can verify that (|8|) is equivalent to 

lim sup || (0(t) - k), ||i = 0, (28) 

fc ^°°re[0,oo) 

which is precisely the condition of uniform equi-integrability which plays a key role in |12t [HJ [5] . We 
may refine Lemma [TJ in the following fashion, adapting the techniques in |12|. [8J [5] to this setting. 

Lemma 2 (Control for L p , p < oo for equi-integrable solutions). If 9{t) satisfies (128p then we have 
\\0(t)\\ p E L™(R+) for all p < oo. 

Proof. We proceed similar to the proof of Lemma [TJ but now slightly refined to take advantage 
of (J28j. Since similar arguments have appeared in several locations (for example |12J [8J [5]) we 
sketch a proof and highlight mainly the differences that appear due to the rescaling in f)13f) . Define 
Gk(T,rj) := (9(T,rj) - k)+ and 



gm-1 
7 fc 



The L p norms of 9 and are related through the following inequality for 1 < p < oo, 

IWSIW + ^Hi- (29) 

It is important to note that the implicit constant in (I29p does not depend on k. Estimating the 
time evolution of 9 k as in Lemma [JJ using Lemma [6] and (|29|) implies, 



-C{p)X -J ({p- l)9 p k + ^C 1 ) V • (e dT S/fC(e T -) * 9 k ) dn 
<-C(p)l + C( P )\\9 k \\lll + C(p,k)\\9 k \\P + C(k,p,M). 
Using the Gagliardo-Nirenberg-Sobolev inequality (Lemma [5]) implies, 

^\\9 k \\ p p < -^^\\e k \\ p P tl + c(p)pk\Zt\ + c ^ V\W + c ^ M )' 

II h 1 1 1 

where Q2 = 1 — &\(p + m — l)/2 > and 

2d(l - l/(p + 1)) 



2-d + dp + d(m - 1) ' 
Note ||0fc|| p < M 1 ^ 2 ||0fc||i+i 1 ' >/ ' P , which by weighted Young's inequality implies, 

^\M P P < -^|plNlSi + C(p)II^IISi + C(fc,p, M). 

Using (|28p we may make the leading order terms as negative as we want and interpolating LP against 
L 1 and L p+1 again implies there is a 5 > such that if k is sufficiently large we have, 

^\\e k %<-6\\9 k % + C(k,p,M). 
By (|29p and conservation of mass, this concludes the proof of Lemma [2J □ 
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3 Finite Length-Scales 

We begin by proving Theorem Q] for the case V/C G L . Alikakos iteration [TJ is a standard method 
for using a result such as Lemma [T] to imply a result of the following form. 

Lemma 3 (Control of L°° for small data). Let V/C G L 1 . Then there exists C-q = C-q(M) and 
C M = C M (\\0o\\q) such that if \\9 \\q < C f and M < C M , then ||6>(t)||oo G L™(R+). 

Proof. Standard iteration implies ||0(t)||oo G L£°(M + ), provided 

v ■= e ( 1 " a -' 3 )/ 3 " lT e dT V/C(e T -) * 9 G L~ (M+ x R d ). 

See [261 HI 13 ED S3 US] ■ For instance, an iteration lemma due to Kowalczyk [31] may be extended 
easily to the case R d , d>2 and to include the V • (i]9) term in (fT3"|) [12] . 

Fixp > d. Then by Lemma[H for sufficiently small M and ||#o||g> ll^( r )llp £ Therefore 
by Lemma [6] in the appendix, 

||W|| P = || e (i-«-/3)^^v (V r V/C(e r -) *0 S )\\ P < e^- a ^ lr \\9\\ p < e^^. 
Moreover, by V/C G L 1 (R d ), 

W\ P < e^- a -^ lT \\9\\ p < eV-c-PW- 1 *. 

Since 1 — a < 0, Morrey's inequality implies v G L^° r? (M + x IR^) and the lemma follows. □ 
By Lemma [3] and the definition of r, 

\\ u (t)\\L™(R d ) = e ~ dT ||0||l^°(]R<i) < (1 + *) i 

establishing ([7J. A similar argument using Lemma [2] in place of Lemma [T] implies 

Lemma 4. Theorem [7] holds if V/C G L 1 . 

Now we turn to Theorem [2j 

Proof. (Theorem [2j Intermediate Asymptotics I) Now that the requisite decay estimate has 
been established, we proceed by estimating the decay of the relative entropy (|2ip . By Young's 
inequality, V/C G L 1 (R d ) and[3 

||e dr V/C(e r -) *9\\ OQ < || V/C||i H^U < 1. (30) 

We first settle the case m > 1. By a standard computation, ([30]) and Cauchy-Schwarz, for all 
5 > 0, 



rf7 



H{9{t)\9 m ) = -1(0) + e {l ~ a -^ lr J V (^-j;# m + ^ M 2 ) ■ #e dr V/C(e T -) * 9dr) 

2 \ V2 



< -1(0) + e (1 - Q - /3)/3_lT I(0) 1/2 (J 9 e dT V/C(e 

< _(i _ e - 25T )I{9) + Ce ( 2 - 2 - W 1 ^ 25 -. 



dq 



11 



Let 7(<5) := (2a + 2/3 — 2)f3 1 — 25 > 0. By the generalized Gross Logarithmic Sobolev inequality, 
Theorem [H we therefore have, 

±H(6{T)\e M ) < -2(1 - e- 25T )H(9\e M ) + Ce^ T . (31) 



Solving the differential inequality (|3ip implies, 

H{9{t)\6 m ) < e ~ rmin ^^\ 
Now by the generalized Csiszar-Kullback inequality, Theorem [5l 

\\0(T)-9 M \\ 1 <e- T 2™^ 5 ». 
Re- writing in terms of x and t and using (1141) , 

\\u(t) - U(t; M)\\i < (l +t )-!rmm(2, 7 (5))_ 

If m < 2 — 2/d, it can be verified that 5 > may always be chosen small enough such that 2 < 7(5). 
If instead m = 2-2/d, then 2d+2-2/3~ 1 = 2. This establishes CE]) in the casep= 1. Interpolation 
against ([7]) completes the proof. 

We now settle the case m = 1. The time evolution of the relative entropy is similar to above. 
By (j30j) and Cauchy-Schwarz, for all 5 > 0, 

^-H(9(r)\9 M ) = -1(0) + e ^' a -^~ lT J V (log 9 + ~ |r/| 2 ) • 9e dr VIC(e T -) * 9d V 

2 \ V2 



< -7(0) + e ( 1 - Q -/ 3 )^/(#) 1 /2 ( / e dT VlC(e T -)*9 



< (1 - e- 5r )7(0) + Ce( 2 - 2a - 2 « rlr+5T . 



The rest of the proof follows similarly to the case m > 1 using Theorems U] and [5j This concludes 
the proof of Theorem [2J □ 

Remark 7. A generalization of Talagrand's inequality [16] shows that 9 — > 9m also in the Euclidean 
Wasserstein distance. 



4 Infinite Length-Scales 

We now turn to the proofs of Theorem Q] and Theorem [3] in the case V/C L . In order to 
properly extend the work of the previous section, we must estimate the quantities \\e dl "VK.(e T •) *9\\ p 
appearing in (|30|) and the proof of Lemma [3l However, V/C 7 1 (M rf ) and Young's inequality is 
not sufficient; in fact we will not bound \\e dl "VK.(e T •) * 9\\ p uniformly in time but instead bound the 
rate at which it grows. We separately estimate the growth of the quantities || A d V/C(A-)l^ 1 (o) 111 and 
||A d V/C(A-)l E d \Bi(o)llp as A — )■ oo. Using |V/C(x)| < |x| 7 for sufficiently large \x\, if 7 < d, then for 
large A, 



I X d \VK(Xy)\ l Bl(0) (\y\)dy = [ \VK(y)\dy 
J J\v\<\ 



y|<A 

A 

-1. 



\VK(poj)\rp a - L dpdu 

is*- 1 Jo 

<l + X d -\ (32) 
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Similarly, if 7 = d, then for large A, 

J X d \VIC(Xy)\l Bl(0) (\y\)dy < 1 + log A. (33) 
If d/(d — 1) < q < 00, since 7 > d — 1, for A sufficiently large we have, 



A^|V/C(A 2/ )|U Rd ^ i(0) (|y|) a !y= / X^ d |V/C(y)| 9 dy 

P f'OO 

= X qd-d / |V/C(/9W)| 9 p d - l d P du> 

<\i(d-~f), (34) 

Similarly, 



sup 



X d VJC(Xx) <1 + A d " 7 . (35) 
We may now complete the general proof of Theorem [JJ 



Proof. (Theorem [TJ We first complete the proof of (i). Lemma [3] extends to the case V/C L 1 
provided we can bound v := e^~ a ~^^ T e dT X7JC(e T •) * 9 in L^°(M d ) uniformly in time. Indeed, fix 
p > d. Then for M and ||0o||g sufficiently small, we have by Lemma [JJ ||0(t)|| p € L£°(IR + ). By 
Lemma 

l|W||, < e^-^\\e\\ p < e a-)/»-\ 

Let q be such that d/(d—l) < q < p, which implies \\6(t) \\ q < 1. If 7 < d then by Young's inequality, 

\\v\\ q < ed—^- 1 - (|| e *V/C( e --)l Bl (o) * 0\\ q + ||e^V/C(e^)l R ^ l{ o) * 0\\ q ) 
< gCi-o,-^-^ (|| e ^v/C(e--)ls l( o)llill^lU + ||e^V/C(e--)lR-\s l( o)ll g ^ 
Since ||0(t)||, < 1, by {35]) and jMD we have, 

Nl, £ e<> x - a -M~* T (l + e^ 7 

< e (l- Q -/3)/3-ir + e (l-/3- 7( a) / 3- 1 r_ 

Since 1 — (3 — 7/3 < and 1 — a < 0, by Morrey's inequality we may conclude if € L^(IR + x 
Similarly if 7 = d, then by the same reasoning as above, ([33]) and ([3lj) imply, 

\\v\\ q <e^-^{l + r + e^) 

< e (l- Q -/3)/3- 1 r M + T ) + ^1-0-70)0-^ _ 

Since 1 — a — /3 < 0, we may conclude also in this case that u G L^° r; (lR + x R d ). Therefore Lemma 
[3] applies with the hypotheses of Theorem [3[ Re- writing in terms of x and t, this implies ([7]). A 
similar proof with Lemma [2] in place of Lemma [JJ also proves (ii) □ 

We now prove Theorem [3l 
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Proof. (Theorem [3) Intermediate Asymptotics II) To complete the proof of Theorem [31 we 
estimate the decay of the relative entropy (|2ip . The proof of Theorem [2] used the estimate (|30p . 
Here we use the bound ||0(r)||oo ^ 1 (|35l) and (f32j) to imply, if 7 < d, 



(36) 
(37) 



\\e dT V)C(e T -) * < (||e dT V/C(e r -)lB l( o) * 0||oo + \\e dr VIC(e T -)l Rd \ Bim * 0||, 

< l _|_ e {d~i)r < g(<2-7)T_ 

Similarly, if 7 = d then, for all 5 > 0, 

||e^V/C(e T -)*#Hoo<l + T<5 e 5T . 

The growth of (|37p in time is the source of the degraded convergence rate observed in (llip . As 
noted above, this is a manifestation of slow decay in the kernel, which causes growth of e dT VK(e T -) 
in L,} . Indeed, computing the decay of the relative entropy (with linear or nonlinear diffusion) as 



above with ([3 

d_ 
~aW 



H{9{r)\e M ) =< -1(9) + e 



{\-a-P)p- l T 



1(0) 



1/2 



At 



<(1 



-2<5 



e dT VJC(e T -)* 



T )I(6) + Ce (2(1 " a " /3),3 " 1+2( ^ 7)+2<5)r 



2 \ 1/2 
dr] 



As before, Theorems d] and [5] imply, 

ll^(r) 



'Ml 



< -Tmin(l,l+7- 



Re- writing in terms of x and i and interpolating against ([7]) completes the proof. The corresponding 
argument follows also for 7 = d, absorbing the mild growth of \\e dr IC(e T ■) * 6\\oo into the 5 already 
introduced. □ 
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6 Appendix 

Lemma 5 (Homogeneous Gagliardo-Nirenberg-Sobolev). Let d > 2 and f : R d — > R satisfy f € 
LP n L q and Vf k £ U . Moreover let 1 < p < rk < dk, k < q < rkd/(d — r) and 

Iks , , 

< 0. 38 

r q d 

Then there exists a constant Cgns which depends on s,p,q,r,d such that 

H/Hj* < C GNS \\f\\ a L l\\f k \\^ s , r , (39) 

where < a, satisfy 

1 = aik + a 2 , (40) 

and 

= ai — + . 41 

q p d r p 
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The following lemma verifies that the distributions defined by the second derivatives of admissible 
kernels behave as expected under mass-invariant scalings. 

Lemma 6. Let K be admissible. Then Vp, 1 < p < oo, u £ L p and t > 0, we have 

|| V {t d VIC(t-)*uj \\ p < p t\\u\\ p . (42) 

Proof. We take the second derivative in the sense of distributions. Let <j> £ C^°, then by the 
dominated convergence theorem, 

t d d Xi K{tx)d x Mx)dx = lim / t d d Xi K{tx)d x .(j>{x)dx 



x \>t 



Him [ t d - l d Xv 1C(tx)p-Mx)dS-iPV I t d d x . x X(txU(x)dx. 

e->0j\ x \ =t | X | J 



By V/C G L^/^" 1 )' 00 , we have V/C = 0{ 
exists C > such that, 



a-d\ 



— > 0. Therefore for e sufficiently small, there 



t I t d - l d Xi K{tx)^-Mx)dS 

x\=e \ x \ 



< Ct I \.r 

\x\=e 



(x)\dS 



Ct 



lexl 1 '* MUxMe^dS = Ct 



l*[=i 



The admissibility conditions (BD) and (KN) are sufficient to apply the Calderon-Zygmund theory 
|44j . which implies that the principal value integral in the second term is a bounded linear operator 
on LP for all 1 < p < oo. The operator norms, which are the implicit constants in (|42p . only depend 
on the bound in (BD) and on the condition 



x\>2\y\ 



\K(x-y)-K{x)\dx < B, 



which is implied by (BD) [Hj. Both of these conditions are clearly left invariant under the rescaling 
in (I42p and this concludes the proof. □ 
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